Distributional vectors encode referential attributes

نویسندگان

  • Abhijeet Gupta
  • Gemma Boleda
  • Marco Baroni
  • Sebastian Padó
چکیده

Distributional methods have proven to excel at capturing fuzzy, graded aspects of meaning (Italy is more similar to Spain than to Germany). In contrast, it is difficult to extract the values of more specific attributes of word referents from distributional representations, attributes of the kind typically found in structured knowledge bases (Italy has 60 million inhabitants). In this paper, we pursue the hypothesis that distributional vectors also implicitly encode referential attributes. We show that a standard supervised regression model is in fact sufficient to retrieve such attributes to a reasonable degree of accuracy: When evaluated on the prediction of both categorical and numeric attributes of countries and cities, the model consistently reduces baseline error by 30%, and is not far from the upper bound. Further analysis suggests that our model is able to “objectify” distributional representations for entities, anchoring them more firmly in the external world in measurable ways.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Mapping conceptual features to referential properties

We report on initial work on bridging the conceptto-reference gap using distributional semantics. Specifically, we aim at predicting properties of countries, using distributional vectors to infer database information. Our results are highly encouraging, since we achieve an error reduction of 30% over the baseline and are not far from the upper bound.

متن کامل

A Neurobiologically Motivated Analysis of Distributional Semantic Models

The pervasive use of distributional semantic models or word embeddings in a variety of research fields is due to their remarkable ability to represent the meanings of words for both practical application and cognitive modeling. However, little has been known about what kind of information is encoded in text-based word vectors. This lack of understanding is particularly problematic when word vec...

متن کامل

Are Distributional Representations Ready for the Real World? Evaluating Word Vectors for Grounded Perceptual Meaning

Distributional word representation methods exploit word co-occurrences to build compact vector encodings of words. While these representations enjoy widespread use in modern natural language processing, it is unclear whether they accurately encode all necessary facets of conceptual meaning. In this paper, we evaluate how well these representations can predict perceptual and conceptual features ...

متن کامل

نقشه سازی و مروری بر آنوفل های ناقل مالاریا در ایران

Introduction:Mapping distribution of endemic diseases with their relations to geographical factors has become important for public health experts, especially in the study of vector-born protozoan diseases with emphasis on spatial or geographical epidemiology. This study was carried out to provide distribution maps of the geographical pathology vectors of Malaria in Iran.  Methods: A systemat...

متن کامل

Referential Nets With Attributes

ODe of the essential problems in natural language production and understanding is the problem of processing referential relations. In this paper I describe a model for representing and processing referential relations: referential nets with attributes. Both processes (analyzing and generating referential expressions) are controlled by attributes. There are two types of attributes, on one hand, ...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2015